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Abstract 



In several applications such as databases, planning, and sensor networks, parameters such 
as selectivity, load, or sensed values are known only with some associated uncertainty. The 
performance of such a system (as captured by some objective function over the parameters) is 
• significantly improved if some of these parameters can be probed or observed. In a resource 

^"*^ | constrained situation, deciding which parameters to observe in order to optimize system perfor- 

mance, itself becomes an interesting and important optimization problem. This general problem 
| is the focus of this paper. 

One of the most important considerations in this framework is whether adaptivity is required 
for the observations. Adaptive observations introduce blocking or sequential operations in the 
system whereas non-adaptive observations can be performed in parallel. One of the important 
£N| ' questions in this regard is to characterize the benefit of adaptivity for probes and observation. 

We present general techniques for designing constant factor approximations to the optimal 
observation schemes for several widely used scheduling and metric objective functions. We 
show a unifying technique that relates this optimization problem to the outlier version of the 
corresponding deterministic optimization. By making this connection, our technique shows 
constant factor upper bounds for the benefit of adaptivity of the observation schemes. We show 
that while probing yields significant improvement in the objective function, being adaptive about 
the probing is not beneficial beyond constant factors. 

sJ ■ 1 Introduction 

a, 

Consider a measurement scenario such as sensor networks or spectrograms, where we have errors 
in estimation, but the distribution of the error is understood through models of the measuring 
instrument, historical data, regression, Kalman filters, etc. For example, suppose we have a set 
of data-points and are trying to construct a classification of them using the minimum spanning 
tree (MST). Suppose that we cannot resolve all the errors but we can repeat and refine a few of 
the measurements, i.e, "probe" a few points to make detailed measurements. A natural question 
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is which measurements do we refine, say if we wanted to refine only k of them and wanted to 
optimize the expected cost of the MST? In many situations, we can assume that the errors made 
by different instruments are independent of each other - even then the problem is NP-Hard |15| 
for computing very simple functions such as the minimum of the readings. In this paper we focus 
on general combinatorial optimization problems. 

The above is a typical example of a model driven optimization problem [15j . This has gained 
significant currency in a variety of other research areas such as database query optimization and 
route selection in networks. In a database query optimization setting suppose the optimizer is 
presented with a set of sufficiently complicated and unrelated (independent) queries. The query- 
optimizer can estimate the resources needed by the queries from historical information, cached 
statistics, sampling of various sub-queries, or by performing inexpensive filters [3j[8j[9]. Subsequent 
to this estimation, the query optimizer schedules the tasks to optimize the throughput or the average 
completion times. However, this estimation process itself consumes resources such as time, network 
bandwidth, and space, and therefore the decision to choose the parameters to refine in estimate 
becomes a key optimization problem. A similar problem arises in networking, where the current 
state of multiple routes and servers can be probed and observed before deciding which route/server 
to use for a specific connection [U EU [20]. The above examples can also be extended to planning 
over a network, where we may wish to visit a sequence of nodes, about which we have imprecise 
information and can only refine a small number of them, to minimize total distance traveled - such 
problems arise in query processing in sensor networks |12j . 

Problem Statement. The above problems of measurement refinement or optimizing independent 
query schedules can be formulated abstractly as follows. Let X\,X2, ■ ■ ■ ,X n be non- negative inde- 
pendent random variables, whose distributions are given as inputs (the distribution can be specified 
by samples and does not affect any result in this paper). Let [n] denote {1,2, . . . , n}. For subset 
SQ [n], let vs denote the realization of {Xi, i G S}. We are given a minimization problem h(o; v), 
where v G denotes the inputs, and o G O denotes the output. 

As an example, in weighted completion time scheduling on a single machine, 1|| ^ ■ WjCj, job 
i has size that follows distribution Xi, and has weight iOj. The vector v denotes a realization of 
job sizes, the space O is the set of all orderings of jobs, and the function h(o; v) = J2i w i J2j^ i v j' 
where j -< i means job j appears before i in the ordering o. Note that if the goal is to solve 
min og o E v [/i(o; v)], the optimal o simply orders the jobs by Smith's rule in decreasing order of 
Efx-l " ^ * ne S oa l i s instead to solve E v [min og e) h(o; v)], for each realization v, the ordering would 
be in decreasing . Our goal will not be to solve either of these extreme versions. 

Our goal will be to solve the probing version of these problems. Here, each variable i G [n] 
has an observation cost q. The exact value of the underlying random variable Xi can be found by 
spending this cost. There is a budget C(> maxjCj) on the total cost of observation. The probing 
policy chooses a subset of variables of total cost at most C to probe and observe. Subsequent to 
this, the optimization policy chooses a solution based on the outcome of the probes. We note that 
the solution is fixed after the outcomes of the probes, but before the realization of the unprobed 
variables. Therefore, given the set of probed variables and their outcomes, the solution is the same 
for all realizations of the unprobed variables. The final goal is to devise a policy (or strategy) for 
adaptively choosing that S C [n] of variables so that E vs [min oe o E Vg [h(o; vgr, vg)]] is minimized. 
The expectation is jointly over outcomes of probes and the distributions of unprobed variables. 
Here, v$ denotes the realization of the random variables {Xi,i G S}, and S denotes [n] \ S. 

In the above completion time example, let ELYj] = fa. If the probing policy chooses a subset S 
of jobs to probe, after the probing, the optimal ordering is Smith's Rule using the exact sizes Vj for 
the probed jobs and the expected sizes fa of the unprobed jobs. For a given realization vs of the 
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variables {Xi,i £ S}, the expected completion time (where the expectation is over the unprobed 
job sizes) is: 



nmE Vs [/i(o;v s ,Vs)]= 



min(wiVj,WjVi) 




The value of the overall solution given S is probed is therefore: 



E vs minE V5 [/i(o; v s , v 5 )] 



(1) 



The objective of the probing strategy is to adaptively choose a subset S to probe, so that this value 
is minimized, subject to Ylies Ci — ^ ■ Note that such a strategy is a decision tree with the next 
decision to probe depending on the outcomes so far. At a leaf I of the decision tree, a subset Si 
has been probed and realized; the job ordering at the leaf would have weighted completion time 
given by Equation ([1]) with S = S[. The expected value of this quantity over the decision tree is 
the value of the probing strategy. The goal is to design a polynomial time algorithm for finding 
the optimal probing strategy. 

Conceptual Issues. Several conceptual and systems issues arise immediately. 

• The most important consideration for a system is quantifying the benefit of being adaptive 
in the probing policy. In the above formulation, the probes could possibly all be done in 
parallel upfront, i.e., non-adaptively, and the results of the probes are ignored in deciding 
which other variables to probe. In contrast, adaptive observations are sequential, based on 
the outcomes of the previous probes. It is clear that adaptive strategies yield as good or 
better solutions than the non-adaptive counterpart. The first issue with an adaptive strategy 
is the complexity of expressing the solution; it is a priori not clear that the decision tree that 
encodes the optimum probing policy (or strategy) is polynomially bounded, so that it is not 
clear if the problem of finding the optimal adaptive probing policy is even in NP. 

• The next interesting issue, which is a consequence of the adaptivity, is the budget. Note 
that while probing, if a suitable value is found already, then the probing can be halted 
early, yielding significant cost savings. Thus it makes sense to assume that the budget is in 
expectation as well. This sets up two classes of problems, namely with hard budgets, where 
the probing budget is C is on all decision paths, and with soft budgets, i.e., the budget C is in 
expectation over the decision paths, so that on some paths, the cost of the variables probed 
could exceed C if this is compensated by halting early on other decision paths. Note that 
this distinction does not arise for non-adaptive policies, where the probes are done upfront in 
parallel. 

Results. In this paper, we design poly-time algorithms for finding approximately optimal non- 
adaptive probing strategies for the large class of scheduling and metric objective functions that have 
been considered in the known applications and literature to date, namely, weighted completion time 
scheduling, minimum makespan scheduling, metric clustering, and Steiner trees. Our results for 
all these seemingly disparate problems use a common underlying technique that also shows that 
these strategies have a bounded adaptivity gap, i.e., there exists a non-adaptive probing strategy 
which is only a constant factor worse compared to the best adaptive probing strategy with soft 
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budgets. Therefore for the problems we study, the non-adaptive and adaptive models with hard 
and soft budgets are all related to each other by constant factors. As a consequence, one of the 
main points we establish is that although probing helps significanthjQ, adaptive probing is no better 
than non- adaptive probing by more than a constant factor in many problems of interest. 

Related Work. The model-driven optimization framework can be defined for other objective 
functions h. This class of problems was first defined by the authors of |15j . where the non-adaptive 
minimum-element problem, with h = minj Xj is considered. Though the current paper presents 
the best results known for scheduling and metric problems, other work [151 1181 [191 120] has considered 
different objective functions, most notably problems involving subset selection. In [181 119] . we 
considered the case where / is a single constraint packing problem such as knapsack with random 
profits which are observable, and present a 8 approximation based on rounding the solution of a 
natural linear program. As in this paper, the approximation ratio holds even when the optimal 
solution is allowed adaptive (i.e., can be based on the results of previous observations). It further 
holds even when the hidden quantity is a distribution (instead of a single value) and a prior 
on this distribution is specified as input. In [20] the authors consider the Lagrangean version 
of h = maxjXj, where the observations are adaptive and the goal is to maximize the expected 
difference between the maximum value and the observation cost. Note that there is no budget on 
this cost, instead it is part of the objective function. We note that the techniques needed for subset 
selection problems are very different from those needed for the scheduling and metric problems 
considered in this paper, where all variables whether probed or unprobed eventually are part of the 
solution. 

The notion of refining uncertainty has been considered in an adversarial setting by several 
researchers |31 [ I13 [ [271 [4], Here, the only prior information about an input is the lower and upper 
bounds on its value. The goal is to minimize the observations needed to estimate some function 
over these inputs exactly, and often strong lower bounds arise. For correlated random variables, 
the problem of minimizing residual information is considered in [29j . 

The adaptivity gap in the absence of probes has been considered in the literature earlier, notably 
in |10[ \TT\ [30l [35] for Knapsack and scheduling problems. However, the model driven optimization 
problem is considerably different from the settings in those papers. In [101 [TT1 137)], 155] the optimum 
is allowed to decide on the next item to schedule based on the past, but once the next item is 
decided this is an irrevocable commitment. In contrast, in our problem, after probing we (as well 
as the optimum adversary) may choose not to include an item in the Knapsack, arbitrarily (re) order 
the schedule, come up widely different clustering depending on the outcome of the probed values. 
Hence, we need completely different arguments and observation schemes in the absence of the 
irrevocable commitment. 

The classic stochastic optimization (non-adaptive, non-probing) versions of these scheduling 
problems were considered in [281 116] . We also note that stochastic optimization problems were 
considered in [261 1221 [251 EH ESI (221 [23] ; these problems appear to be unrelated to adaptivity gaps. 

2 General Framework 

The types of objectives we are interested in this paper are scheduling and metric problems. For 
these problems, the key feature is that the final solution has to be constructed over a^the variables 
whether probed or unprobed. Our main contribution is to identify a key recombinant property 
and an uniform solution recipe that shows constant factor adaptivity gap for all problems which 

1 We show in Section [3] that probing does indeed improve the objective of 1|| WjCj by a polynomial amount. 
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satisfy the property. We present this general technique below, and adapt it to specific problems in 
the subsequent sections. 

2.1 Problems Considered 

Before presenting the general algorithmic framework and analysis using the recombinant property, 
we summarize our problems and results. 

Scheduling Problems. For scheduling we consider the average completion time (1| | ^ ■ WjCj) and 
minimum makespan on identical machines; in both cases, the job sizes are random variables. We 
focus on the weighted completion time which illustrates the main issues (Section [3]). A surprising 
feature of this algorithm is that only the expected values of job sizes are used in constructing the 
approximately optimal probing scheme. The makespan problem is discussed in Section 0J For 
both these problems we design non-adaptive algorithm which shows that the adaptivity gap is a 
constant. 

Metric Problems. We next consider several optimization problems in metric spaces. We assume 
that the input nodes are discrete distributions of polynomial specification over points of the metric 
space. For these problem a small but important added twist is needed - we need to reformulate the 
problem on a different but related metric. We consider the A;-Median and Steiner Tree (or TSP) 
problem. We discuss the /c-Median problem and the general setup for these problems in Section [5j 
The MST problem is discussed in Section [U For both these problems we design non-adaptive 
algorithm which shows that the adaptivity gap is a constant. 

For the /c-median problem an interesting issue comes to fore, which does not appear for deter- 
ministic input. It is well known that over any space, there exists a fc-median solution which uses 
the input points and is at most twice the optimum. For distributional input we show that the 
adaptivity gap for obtaining exactly k medians is polynomially large, short of a polynomial blowup 
in the probing cost. However, if we restrict all solutions to use fixed points in the metric space as 
medians (as opposed to declaring an input node which could possibly be a distribution over points 
as a median) then the gap disappears! This exposes an interesting contrast in the problem based 
on which points are allowed to be medians. We expect these definitional issues to be of independent 
interest as more problems with distributions as input are investigated. 

2.2 The Non-adaptive Algorithm 

For S C [n], let Os denote the space of solutions constructed on the subset S of input variables. Let 
O = C[ n j. For instance, in weighted completion time scheduling, Os denotes the space of orderings 
of the jobs in subset S. Given a realization va of variables {Xi,i G A}, let min oe e> A Ha(o, va) 
denote the optimal objective using only these realized variables. 

Definition 2.1. The outlier problem Q*(C) is: Choose a subset S of variables so that Ylies c « — C 
as outliers (i.e., to ignore), so that min oe e> 5 E Vg [h§(o;v§)] is minimized. Here, S = [n] \ S. 

As an example, in weighted completion time scheduling, Q*{C) would correspond to choosing 
a subset S of jobs with total probing cost at most C as outliers (i.e., to ignore) so that for the 
remaining jobs S, the expected completion time (with no probing), Yli jes j>i w jl I i ^ s minimized. 
Here, m = E[X;]. 

Our non-adaptive strategy has the following simple structure: 

1. Solve the outlier problem Q*(C). Suppose this outputs a subset S C [n] of as outliers. 
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2. The non-adaptive algorithm probes S, and for the realization V5 of these variables, subse- 
quently solves min oe e) E Vg [h(o; V5, v^)] on the observed and unobserved variables. 

As an example, in weighted completion time scheduling, suppose the outlier problem returns a 
subset S of jobs as outliers. The probing strategy observes the sizes of these jobs. It then sorts the 
jobs in decreasing order of and schedules them, where k = m if i G S, and U = Vj if i G S*. 

2.3 Analysis 

The guarantees we obtain will depend on the approximation ratio achievable for Q*(C) and for 
solving min og £) E v ^ [h(o; V5, Vg)]. We will require two properties about the problems we consider. 
The first property is downward closure. 

VoG [n] ,AC [n], 3o A £O a s.t. Vv [n] h A (o A ; va) < h(o; v [n] ) (PI) 

The above property trivially holds for all the problems we consider: weighted completion time 
scheduling and minimum makespan scheduling where v denotes job sizes; and geometric /c-medians 
and Steiner trees, where v denotes locations of points in a metric space. 

Using (PI), we will show a lower bound for the adaptive optimal solution in terms of the outlier 
problem. In particular, we relate the quantity Q*{C) to the values OPT s and OPTh of the optimal 
adaptive solutions with soft and hard probing budgets C respectively. Recall that in the hard 
budget version, the probe cost has to be at most C on all adaptive decision paths, whereas in 
the soft budget version, it is only required that the expected probe cost be at most C. Clearly, 
OPTh > OPT s since the soft budget version relaxes the hard budget constraint. 

Lemma 2.2. Q*(C) < OPT h , and <?*((1 + (3)C) < (l + OPT s for all p > 0. 

Proof. First consider the optimal decision tree with hard budgets. At each leaf I of this tree, the 
probing cost is at most C. Let S denote the subset of probed variables corresponding to this leaf 
node, and denote the values observed for these variables. Let opti denote the objective function 
at this leaf node. By property (PI), we have: 

opU = minE Vs , [h(o;v s ,v § )] > mm E Vs , [h(o § ;v § )] > Q{C*) 

Therefore, opU > Q*{C). Since this is true for all leaves /, we have OPT h > Q*{C). 

Next consider the optimal decision tree with soft budget C. Consider all leaves I with probing 
cost at most (1 + /3)C. By Markov's inequality, the probability of such a leaf is at least j^. For 
each of these leaves, opt\ > Q*((l + P)C). Therefore using (PI), we have: 

oPT 8 >JLpg*({\ + p)C) => g*{{\ + i3)c)<{i + ^j-oPT s 

□ 

Recombinant Property. The next and more non-trivial property relates to combining the so- 
lutions for the probed and unprobed parts. This property crucially requires that the random 
variables X-i are independent, and holds for all product distributions. In the inequality below, the 
expectations are over v^. 
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VA C [n],v A ,mmE[h(o;\r A ,v A )] 



<p min 1ia(o;va) + m i n ^[^(o; v^)] + E[min h(o; v^, v^)] 
.oeo^ oeOi ' oeo 



(P2) 



At a high level, the recombinant property holds if for an arbitrary partitioning of the input 
variables, each of the parts have an induced solution with the property that the objective function 
values sum to no more than the respective quantities in the original problem; further, the two 
solutions can be combined without a significant increase in value of the resulting solution. Since 
encoding the value of the probed part is unwieldy, this partitioning is necessary to construct a 
solution that encodes just the unprobed part. The recombination shows a small adaptivity gap - 
the tricky part is to ensure that the interaction of the probed and unprobed parts can be bounded. 
This technique also yields non-adaptive probing strategies in addition to the adaptivity gap proof. 

For all the problems we consider, we will show that p is a constant. Using this, we can now derive 
the approximation ratio for the non-adaptive strategy. Suppose Step (1), the outlier problem, has 
approximation ratio 71, and Step (2), solving min og e> E Vg [h(o; vg, v<j)] given vg has approximation 
ratio 72, then we have the following result: 

Theorem 2.3. The non-adaptive algorithm finds a solution of value at most P7i(2 + j2)OPTf l . 
It further finds a solution of value P7i(2 + 72) ^1 + jjj OPT s using outlier cost (1 + f3)C for any 
> 0. 

Proof. Given the computed set of outliers 5 and the final solution v Vs for the hard budget case, 
we have the following sequence of inequalities: 



E vs [E vg [/*K s ;v Sj vg)]] 
< 7 2 E VS 



mfoE V5 [h(o;v s ,v § )] 



< 72PE VS 

< 2 72 pE v 



min h A (o; v 5 ) + min ~E[h s (o; v s )] + E[min h(o; v s , vg)] 



min h(o: v) 



< 2 72 pE v 

< 72P(71 + 2)OPT h 



min h(o: v) 

oeo 



+ 72PE VS 
+ 72P7i^*(C) 



min E[h s (o;v § )\ 

oeo g 



The second inequality follows from Property (P2), the third from Property (PI), and the final 
inequality from Lemma 12.21 The proof for the soft budget case is identical. □ 



Definition 2.4. We will denote rj = 72/0(71 + 2) as the approximation ratio of the problem. 

The above guarantees imply bicriteria approximations (where the probing cost also increases by 
a constant factor) for the soft budget versions of all problems, and for the hard budget versions of 
the metric problems we consider. The increase in probing cost is unavoidable using our techniques, 
since the best known algorithms for the outlier versions of the metric problems we consider have 
similar gaps. An interesting open question is to show a complexity result that the increase in 
budget is unavoidable. 
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3 Scheduling I: Weighted Completion Time on Single Machine 



We first consider the weighted completion time problem (1| | ^ WjCj) of scheduling jobs on a single 
processor to minimize the sum of the weighted completion times. All jobs are released at time t = 
and there are no deadlines or precedence constraints. 

In the probing model the sizes (or processing times) of jobs J\, . . . , J n are distributed according 
to independent random variables X±,X2, ■ ■ ■ ,X n respectively. The weight of job J« is Wi, which is 
not a random variable. Let EpQ] = /ij. Each variable Xi corresponding to the size of job Jj has 
probing cost a; probing yields its exact value. Let C denote the (soft) budget on probing cost. 

The solution is a strategy for adaptively probing a subset the jobs so that the expected weighted 
completion time of scheduling all the jobs after the outcome of the probes is known, is minimized. 
This expectation is over the outcome of the probes, and over the distribution of the processing 
times of the unprobed jobs. We note that the scheduling policy fixes the ordering o G O of all 
jobs after the results of the probes (say set S) are known, but before the sizes of the unprobed 
jobs are revealed. Therefore, the optimal scheduling policy whose value is min og E Vg [h(o; vg, vg)] 
will simply order the jobs in decreasing order of weight to the ratio of (expected) processing time 
(Smith's Rule) - this processing time is exactly known for probed jobs S, and is the expected 
processing time for unprobed jobs Sq 

Benefit of Probing. To gain intuition, we present a simple example where probing helps by a 
factor of f2(n). There are n jobs with unit weights and unit probing costs, and sizes which are 
w..p. 1 — 1/n, and 1 with the remaining probability. If no jobs are probed, each job has expected 
size 1/n, so that the expected completion time of any ordering is Yj7=i V n = ^ir~- If au j° DS are 
probed, the jobs which are of size 1 can be placed after jobs of size 0. If there are k jobs of size 1, 
the completion time is M^hli_ xhe variable k follows Binomial(n, 1/n), with mean 1 and variance 

at most 1. This implies E[fc 2 ] < 2, so that E k ( k + l "> < |. Therefore, probing yields a O(re) benefit. 

Main Result. Recall that p is the approximation for Property (P2), 71 is the approximation ratio 
for the outlier problem Q(C*), and 72 is the approximation ratio for solving min og o E Vg [h(o; vg, Vg)]. 
We will show the following theorem. 

Theorem 3.1. For the weighted completion time scheduling problem, p = 2, 71 = 3 using cost 
budget 2>C , and 72 = 1. This implies an approximation ratio of n = 10 for the non-adaptive probing 
strategy that uses cost budget 3C. 

The proof that 72 = 1 is simple, since the optimal scheduling policy whose value is min oe o E v§ [h(o; vg, vg)] 
will simply order the jobs in decreasing order of the ratio of weight to (expected) processing time 
(Smith's Rule) - this processing time is exactly known for probed jobs S, and is the expected 
processing time for unprobed jobs S. We will first show p = 2, and then design a 3-approximation 
for the outlier problem that violates the cost budget by a factor of 3. 



3.1 The Recombinant Property (P2) 

We first show Property (P2) holds for p = 2. Consider n jobs with deterministic lengths. Let 
h,h, ■ ■ ■ ,ln denote the jobdengths, and w\, Wn, ■ ■ ■ , w n denote the job weights. Let = . By 
Smith's rule, the optimal solution sorts the jobs in increasing order of ctj and schedules in this order. 

2 This is verified by estimating the benefit/loss of an exchange of consecutive jobs on each scenario and then 
aggregating over the scenarios, see also [32] . 
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The completion time of the optimal ordering can therefore be written as WiWj min(aj, ctj). 

i=l j>i 

We have: 

Lemma 3.2. For any partitioning of n deterministic jobs into two disjoint sets A and B, we have: 
WiWj min(aj, ctj) + WiWj min(aj, ctj) > WiWj min(a!j, ctj) 

i,jeA,j>i i,j&B,j>i ieAJ&B 

Proof. Let 7^ = min(aj, ctj). First, suppose all 7^ = 1. We have: 

(Y,ieA w i-Y,jeB w j) ^ 

\ieA ) \jeB j 
y--J- Aj • £ij< ".j •,"•'''•7 > S W<W J 

We next consider the case of general ccj. We will prove this by induction on the number of 
jobs (the base case being trivial). Let i* = argmimaj. For each job i, let = a% — cti*, and let 
Sij = mm(/3i, (3j). We have: 

WiWjlij = WiWj{a>i* + Sij) 

From the proof of the 7^ = 1 case, we have: 

a>i* w i w j + /2 w i w j > «i* ^ WiWj 

\i,jeA,j>i i,jeB,j>i ) i£A,j<=B 

The set of jobs with non-zero (3 values is strictly smaller than n. Let Z be the set of jobs with 
fa = 0. By the inductive hypothesis we have: 

WiWjSij + ^2 mwj Sij > ^2 WiWj Sij 
i,jeA\z,j>i i,jeB\z,j>i ieA\zjeB\z 

Adding the previous two inequalities, we have the proof of the lemma. □ 

In the above lemma, the LHS represents the contribution to the optimal completion time which 
arises from job pairs within A and within B. The RHS represents contributions of job pairs such 
that one of the jobs is in A and the other in B. The above shows that the interaction term across 
the two sides of any partition, can be bounded by the sum of the interactions within each side. 
This directly shows the recombinant property (P2) with p = 2: 



Lemma 3.3. 



VA C [n],v A , minE[/i(o; v A ,v A )] < 2 
where all expectations are over v A . 



min /i A (o; v a ) + mm E[/i A (o; v A ) 
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Proof. Let m = E[Xi\. We have: 



mm fiA(o;v a) = / mm(wjVi, w{Vj) 
min E[h^(o; v^) = mxa.(wj /h, Wi/ij] 

I ,J6AJ>! 



Furthermore, we have: 



minE[/i(o; v A , v^)] 

= min(wjVj, ^jVj) + min(u>j/ifj, lUj/ij) + min(iOjVi, lUj/ij) 

i>3&A,j>i i,j£A,j>i i€A,jeA 

= min Ha(o;va) + niin E[/i^(o; v^) + N min(w,Vj, Wi/J,j) 

By Lemma [321 we have: 

^ min^-Vi^i/ij) < V] min^jVijiyjVj) + V] mm(wjfj,i,WiHj) 

ieA,jeA i,j&A,j>i i,j&A,j>i 

Putting the above two inequalities together completes the proof. □ 



3.2 Approximation Algorithm for the Outlier Problem 

To complete the proof of Theorem 13. 1\ we finally focus on solving the outlier problem Q*(C). 
Consider the following integer program for the outlier program. Here, variable z% £ {0, 1} is set to 
1 if job i is in the outlier set. Variable £ {0, 1} is set to 1 if jobs i and j are both not in the 
outlier set. 

n n 

min ^2 ^2 e ij mm{wjm, Wifij} + ^(1 - Zi)wim 

i=l j>i i=l 
n 

eij + %i + Zj > 1 for all i,j>i 

eij,Zi £ {0, 1} for all i, j G {1,2, . . . ,n} 

It is easy to check that this integer program encodes the problem Q*(C). To see this, note that 
the objective is precisely the weighted completion time of the jobs not in the outlier set. The first 
constraint encodes the cost constraint on the outlier set, and the second constraint encodes that if 
jobs i and j are both not in the outlier set, then = 1, but if Zj or Zj is 1, then = 0. 

We solve the linear relaxation of the LP, where the final constraint is replaced with e(i,j) > 
and Z{ > 0. Now round the LP solution as follows: Let S = {i\z{ > |}. Set Z{ = 1 if i € S, and set 
eij = 1 if i S and j S. The set S is the outlier set output by the algorithm. 

Theorem 3.4. The outlier problem Q*{C) admits to a 3- approximation that violates the cost budget 
C by a factor of 3. 

Proof. It is easy to check that z% < 3zi and e~ij < 3eij. To see this, simply observe that = 1 iff 
Z{ < 1/3 and Zj < 1/3, in which case e,j > 1/3. Therefore, the objective is within a factor 3 of the 
LP objective, and the first constraint is violated by a factor of 3. □ 
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4 Scheduling II: MAKESPAN on Identical Machines 



We now consider the problem of minimizing the makespan on identical parallel machines. In 
this problem, there are m identical machines, and n jobs, whose sizes are random variables, 
X\, X2, ■ ■ ■ , X n . We can probe job i by spending cost q, and find the exact value of its pro- 
cessing time. Given a bound on the total query cost C, the goal is to find the subset of variables 
to probe so that the expected value of the makespan (i.e. the load on the most loaded machine) is 
minimized. Here, the expectation is over all realizations of the sizes of the unprobed jobs. 

Recall that p is the approximation for Property (P2), 71 is the approximation ratio for the 
outlier problem Q(C*), and 72 is the approximation ratio for solving min og £) E Vg [h(o; vg, vg)]. We 
will show the following theorem. 

Theorem 4.1. For the minimum makespan scheduling problem, we have 71 = 0(1), 72 = 0(1), 
and p = 1, so that the approximation ratio of the non-adaptive strategy is 77 = O(l). 

Recombinant Property. It is easy to see that for any A E [n], for all o\ G Oa and 02 S O^, the 
concatenation of these solutions (call it o E O) satisfies: 

h(o; v) < h(oi; va) + h(o 2 ; V4) Vv 

This directly implies Property (P2) with p = 1. 

Outlier Problem. The outlier version Q*(C) is the following: Find a subset S of jobs with cost 
at most C to discard so that the optimal expected makespan on the remaining jobs is minimized. 
We therefore consider the problem V of scheduling the jobs in the absence of probing. The goal is 
simply to assign the jobs to the machines so that the expected makespan, which is the expected load 
of the most loaded machine, is minimized. Here, the expectation is over all possible realizations of 
the job sizes. This problem was addressed by Kleinberg, Rabani, and Tardos [28J, who show a nice 
characterization. 

Definition 4.2. Given a random variable Xi, define the random variable Yn,Zit as follows 

Y f Xi/t ifX t <t z = f ifXi<t 

l ' [0 otherwise l ' \ Xi otherwise 

In other words Y is the contribution of X below a threshold t and Z is the contribution of X above 
t; naturally Xi = t ■ Y{j + Zi f. Let 

Yi. 



'—' \ m log to / 



Theorem 4.3 (|28j). Given a set of jobs whose sizes follow independent random variables {Xi}, 

• If^2i fi(t) > t/3 then any scheduling of the jobs on m machines has expected makespan Q(t). 

• IfYli fi(t) then the following algorithm produces a schedule with (expected) makespan 
0(t): 

1. Define the "effective size" of job i to be f}i(t) = log f}™ m ' - • 

2. Consider the jobs in arbitrary order, and place each job on that machine where the sum 
of the rjj(t) of the already scheduled jobs j is minimum. In other words, use Graham's 
rule on r}j(t). 
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The following algorithm solves Q*(C) to a 0(1). Try different values of t in increasing powers 
of (1 + e). For each t, decide if there is a subset S of jobs so that: (1) X^eS c * — ^> an< ^ (^) 
^2i^s fi(t) — By Theorem 14.31 the smallest such t (call it t*) for which the decision problem 
returns a "yes" answer is the desired O(l) approximation to Q*(C). It is easy to see that t* will 
be at most the sum of the maximum values the Xi can take. Therefore, since the distributions are 
discrete and specified as input, computing f%{€) takes polynomial time for any t of interest. This 
decision problem is a knapsack problem, and can be solved to a (1 + e) approximation on t* without 
violating the cost budget. This shows 71 = O(l). 

To show that 72 = 0(1), we simply schedule the probed jobs using Graham's rule on the ob- 
served sizes. Separately schedule the unprobed jobs using the algorithm from [28], and append this 
schedule to that of the probed jobs. Since the probed jobs are scheduled to a factor 2 approxi- 
mation, and the unprobed jobs are scheduled to a 0(1) approximation (by the result of [28]). this 
shows that 72 = 0(1), completing the proof of Theorem 14.11 

5 Metric Problems I: i\~-median Clustering 

We next consider several metric problems, specifically the fe-median clustering and the minimum 
spanning tree problem on metric spaces. Before defining the problems, we will define the uncertainty 
model. 

5.1 Uncertainty Model for Metric Problems 

We define the uncertainty model as follows. We are given a metric space with point set V, which 
defines a distance function I. The input is a set of nodes V, where the location of node i follows 
an independent distribution Xi over V . Distribution (or node) i £ V has probe cost a; on probing, 
the node resolves to one of the locations in V . Since we can only probe a subset of nodes, but must 
finally construct a solution over all the nodes, it will be helpful to define a metric space over the set 
of points V and nodes V, i.e., over the set M. = V U V. For i,j G V, let D(i,j) = l(Xi,Xj) denote 
the random variable corresponding to the distance between Xi and Xj. Let d(i,j) = E[D(i,j)] 
where the expectation is over the random variables Xi and Xj. 

Claim 5.1. d(i,j) + d(j,k) > d(i,k). 

Proof. For any realization of the values of Xi,Xj, and X^, we have D(i,j) + D(j,k) > D(i,k). 
Taking expectations over the random choices of Xi,Xj, and X^, the claim follows. □ 

Define the following metric space: The vertices are the points A4 = V U V. The distance metric 
is d. For i,j e V, d(i,j) = E[D(i,j)] = E[l(Xi,Xj)]. For i,j G V, d(i,j) = For i G V,j G V, 

define d(i,j) = E[l(i,Xj)]. Using the above claim, it is clear that the function d defines a metric 
space on the nodes VWP. This completes the modeling of uncertainty for the metric problems we 
consider. 

5.2 fT-medians: Problem Statement 

In this problem, we are given a metric space with point set V, which defines a distance function I. 
The input is a set of nodes V, where the location of node i follows an independent distribution Xi 
over V . Distribution i G V has probe cost Cj. The goal is to design an adaptive policy to probe the 
nodes which spends expected cost at most C. After probing, the algorithm opens K centers and 
assigns all probed and unprobed nodes to some center so that the expected distance cost (or value) 
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of the clustering is minimized. This expectation is over the locations of the unprobed nodes. Note 
that after probing, the center selection and assignment policy assigns an unprobed node % to that 
open center w which minimizes the expected distance E[/(Aj,w;)] where the expectation is over the 
random variable Aj. The goal is to design a probing policy whose resulting expected distance cost 
(or value) of A-median clustering is minimized, where the expectation is over the outcomes of the 
probes and the locations of the unprobed nodes. 

We consider two variants of the problem. In the first variant, we assume that the center selection 
policy is restricted to opening centers from a set S Q V . This means the centers can only be chosen 
from points of the underlying metric space. Therefore, for an unprobed node i assigned to a center 
w E V, the expected distance cost is E[Z(A*j, w)], where the expectation is over the random variable 
X%. In the second variant, the centers are allowed to be input nodes, and therefore distributions. 
Now, an unprobed node j can be opened as a center after probing a set of nodes. Suppose an 
unprobed node i is assigned to this center, then the expected distance cost is E[Z(Xj, Xj)], where 
the expectation is over both the random variables Xi and Xj. 

We present a constant factor adaptivity gap for the former variant. We then show that the 
adaptivity gap is polynomially large for the latter variant. This shows a fundamental difference in 
the two variants. 

5.3 Fixed Centers 

We first consider the first variant where the centers can only be points from S C V. Recall that p is 
the approximation for Property (P2), 71 is the approximation ratio for the outlier problem G(C*), 
and 72 is the approximation ratio for solving min oe o E Vg [h(o; vg, v^)]. We will show the following 
theorem. 

Theorem 5.2. For the K-median problem with fixed centers, p = 5, 71 = 5 with outlier cost 5C, 
and 72 = 3 + e. This shows an approximation ratio of rj = O(l) with outlier cost 5C. 

As shown in Section 15.11 first define the new metric space d over the points V U V. Given the 
set of probed values vg C V 7 and the unprobed nodes S C V, the problem reduces to deterministic 
A-medians on vg U S C V U V in metric space d, and this shows 72 = 3 + e using the algorithm 
in [2]. In the outlier problem, the goal is to find the subset T C V of nodes of total probing cost 
at most C such that the cost of if -median clustering of the remaining nodes V \ T in metric space 
d is minimized. This problem has a 71 = 5 approximation due to the authors of [6] if the outlier 
cost is relaxed to 5C. If all probing costs are unit, we have 71 = 0(1) approximation that does not 
relax the outlier cost [7|. The only part left to show is the recombinant property (P2). 

Lemma 5.3. 

\/A C [n], va, minE [h(o; va, v^)] < 5 • min E[/i^(o; v^)] + 4 • E[min/i(o; v^, v^)] 
where the expectations are over v^. 

Proof. The quantity Q\ = min oe o E [h(o; v^, v^)] on the LHS involves constructing the optimal 
if-median solution using the actual realization of the probed set A, and the expected locations 
of the unprobed nodes A. To bound this cost, we first construct the optimal K medians on the 
unprobed nodes A by paying Q2 = m.in ^o A E[/t^(o; v^)]. Next, move the unprobed nodes to the 
assigned median. For i £ A, let 4>(i) denote the assigned median. 

We will now proceed by considering scenarios va,v^ of the values of all nodes. Fix some such 
scenario a. Let M a denote the value of the optimal A'-median solution in this scenario. The 
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expected value of this optimal solution over the realizations of is Q3 = E[min oe o h(o; v_a, v^)]. 
Let P' C V denote the set of points va, along with the nodes i G A located at point <j)(i). Note 
that the set P a is the same for all realized a. Since is fixed, and the mapping is independent 
of the scenario, we have P' is independent of the scenario a. 

We will now construct the solution corresponding to the quantity Q\ using the solution for Q2 
and the solutions M a . In scenario a, send each point in A its assigned center in the optimal solution 
M a . Similarly, send each node % 6 A located at 4>(i) back to its realized location in scenario a and 
from there to its assigned center in M a . This yields a mapping from the nodes P' to K medians 
in scenario a. The expected distance value (over scenarios a) of this mapping is at most Q2 + Q3 
by triangle inequality. Note that the distances in this mapping are distances between points in V, 
and do not involve the distributional nodes V. This yields a valid K- median solution on P'. 

Since there is a feasible if-median solution for each <r, these when averaged over a define a 
fractional K-median solution for the set of points P' of value at most Q2 + Qz- Therefore, there 
is an integer K- median solution on P' of value 4 times this fractional value [5] . The final solution 
corresponding to Q\ maps the points i 6 A first to 4>(i) paying cost Q2, and then maps all the points 
to these constructed centers paying cost 4((^2 + Qs)- This shows Q\ < 5Q2 + 4Q3, completing the 
proof. □ 



5.4 Arbitrary Centers 

Consider now the case where the centers themselves are allowed to be input nodes, and therefore 
distributions. After probing, the center selection policy could decide to open an unprobed input 
node as a center, and assign probed and unprobed nodes to this center. The distance cost between 
the center and the assigned node is the expected distance between them, where the expectation 
is taken over possible locations of the center and the assigned node. We note that the linear 
programming relaxation is still a lower bound on the adaptive optimal solution. 

By constructing separate K-median solutions for the probed and unprobed variables, it is easy 
to show a O(l) approximation that uses 2K centers and pays probing cost 5C. We show that 
this is the best possible result for this case in the following sense. We show that any non-adaptive 
algorithm has polynomially large adaptivity gap on both distance and probing cost if it is restricted 
to opening at most (1 + e)K centers for some constant e > 0. Intuitively what fails in reducing the 
IK centers to K centers is the following: In the proof for fixed centers, we mapped the unprobed 
nodes of the non-adaptive solution to a set of nodes in V, and constructed a fractional solution 
using these locations for these nodes. This ensures that the metric space is over V and independent 
of distributions. In the current setting, an unprobed node could be mapped to an unprobed center. 
Therefore across scenarios, the metric space over the locations of the nodes changes. 

Theorem 5.4. The adaptivity gap for K -medians when centers can be input nodes is polynomially 
large on both distance and probing cost. 

Proof. Consider M distinct copies (at a mutual distance of at least M 3 L from each other, L = M 2 ) 
of the following 2-dimensional node set. In copy m, there are r + 1 "cheap" nodes X\ m , X2 m , ■ ■ ■ 
which cost 1 to probe. Distribution Xi m is (0,0) with probability 1/2 and (i + 1,0) otherwise. 
In addition, there are t S> vM pairs of nodes which are well-separated from other pairs by a 
large distance L. Pair j corresponds to two distributions: Yj m and Zj m . Yj m is (L + jL, 1) with 
probability 1 — log t/t and (L + jL, 0) with probability logt/t. Zj m is (L + jL, —1) with probability 
1 — logt/t and (L+jL, 0) with probability logt/t. These distributions are "expensive" with probing 
cost (r + 1)M, where r > 2 log Mt. Again note that the nodes for each m are far removed from the 
nodes corresponding to other m. 
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Let K = (2t + r)M. For each to, the adaptive solution will place 2t + r medians using the 
following strategy. First probe all cheap distributions X*. There are two cases: 

Case 1. For some m, X* m resolve to r + 1 values distinct points. This happens with probability 
l/2 r+1 for a particular m and therefore ^ftt overall. In this case, probe all the "expensive" 
distributions Y* and Z*. Choose the r + 1 realized points X*, and some 2t — 1 of the remaining 
2t points as medians. With probability 1 — (1 — logt/t) 2t > 1 — 1/t 2 some (Y,Z) pair collides on 
the x-axis. In this case, the k medians solution has distance cost 0, else it has a distance cost of at 
most 2. The expected distance cost is therefore at most (l/2 r+1 ) • (1/t 2 ) ■ 2 for each m. Therefore, 
the overall expected distance cost is p^r. 

Case 2. If at most r distinct values of X* m are observed for all m, then do not probe further, 
since there is a A; median solution of value 0, in which every one of the expensive nodes and all the 
realized locations of the cheap nodes are the medians. 

The expected probing cost of this scheme is at most M(r + l) + (r + l)M -2Mt- t^tt < 4M(r + l). 

Any non-adaptive probing scheme must probe at least one expensive distribution in each copy, 
else the distance cost is at least i^r+i 3> in that copy: In the case where the cheap distributions 
resolve to distinct values w.p. l/2 r+1 , the distance cost will be at least 0.5. Therefore, the probing 
cost needed is (r + 1)M 2 , which implies that unless the probing cost is a factor M larger than the 
adaptive scheme, the distance cost must be 4j times larger. Therefore, no non-trivial adaptivity 
gap is possible. □ 



6 Metric Problems II: Minimum Steiner Trees 

We finally consider the minimum Steiner tree problem - the algorithm extends to Metric Traveling 
salesman problem naturally. As with if-medians, the input is a collection of n nodes. The location 
of node i is an independent random variable Xj, which is a distribution over points V in a metric 
space with distance function I. Let V denote the set of nodes. The exact location of node i is 
determined by spending probing cost Cj. The goal is to design an adaptive probing scheme which 
minimizes the expected cost of connecting the nodes by a spanning tree, subject to the constraint 
that this decision tree has expected probing cost at most C. 

Recall that p is the approximation for Property (P2), 71 is the approximation ratio for the 
outlier problem Q(C*), and 72 is the approximation ratio for solving min og e> E Vg [h(o\ V5, v§)]. We 
will show the following theorem. 

Theorem 6.1. For the metric Steiner tree problem, we have p = 1, 71 = 4 with outlier cost AC, 
and 72 = 1.55. The non-adaptive strategy therefore has an approximation ratio r] = 0(1) with 
probing cost AC. 

Since the overall algorithm is similar to the if-medians problem, we simply highlight the dif- 
ferences. As discussed in Section 15. fl we construct the metric space d over A4 = V U V. The 
outlier problem over the nodes V in the metric space d is defined as follows: There is a cost of 
Ci for every node i E V. The goal is to choose a set of nodes S C V with total cost at most C 
to discard such that the value of the Steiner tree on the nodes in V \ S is minimized. This has 
a 72 = 4-approximation that spends outlier cost AC [IT]. If all probing costs are unit, we have 
72 = 4-approximation |14j that obeys the cost constraint. The minimum cost Steiner tree problem 
on the probed and unprobed nodes has an approximation ratio of 72 = 1.55 [33J. 

To show the recombinant property (P2) with p = 1, we make the following observations. For 
any set A of probed nodes with realization v^, /ia(o;^a) is the value of the optimal Steiner tree 



15 



T\ connecting these realized nodes; min og 0^ E[/i^(o; v^)] is the value optimal Steiner tree T2 con- 
necting the unprobed nodes; and finally, E[min og e> h(o; v^, v^)] is at least the expected cost of any 
edge e = connecting some i 6 A and j G A. Taking the union of T\, T2, and e yields a tree 
whose cost is at least min oe e> E[/i(o; v^, v^)]. 

7 Conclusion 

In this paper, we considered a large class of scheduling and metric problems when the inputs follow 
distributions that can be observed by paying a cost. We showed that the problem of computing 
the optimal budgeted resolution scheme for these inputs (that minimizes the expected value of the 
posterior optimization) is closely related to the outlier version of these problems. This work results 
in several open questions. First, can the approximation ratios be improved by an algorithm that 
designs adaptive resolution schemes instead of non-adaptive schemes? Next, our model for metric 
problems assumes nodes are uncertain and can be resoled at a cost. What if only distances between 
nodes can be resolved at a cost? The problem now is that the distributions for the edge lengths 
cannot be independent random variables, and we would need new techniques. Finally, it would 
be interesting to explore if the outlier scheme can be extended to a larger class of model-driven 
optimization problems. 

Acknowledgments: We thank the anonymous referee for significantly improving the presenta- 
tion of this paper. We also thank Sariel Har-Peled, Ashish Goel and Anupam Gupta for helpful 
discussions. 

References 

[1] A. Akella, B. M. Maggs, S. Seshan, A. Shaikh, and R. K. Sitaraman. A measurement-based 
analysis of multihoming. In ACM SIGCOMM Conference, pages 353-364, 2003. 

[2] V. Arya, N. Garg, R. Khandekar, A. Meyerson, K. Munagala, and V. Pandit. Local search 
heuristics for fc-median and facility location problems. SIAM J. Comput., 33(3):544-562, 2004. 

[3] S. Babu and P. Bizarro. Proactive reoptimization. In Proc. of the ACM SIGMOD Intl. Conf. 
on Management of Data, 2005. 

[4] M. Charikar, R. Fagin, V. Guruswamy, J. Kleinberg, P. Raghavan, and A. Sahai. Query 
strategies for priced information. Journal of Computer and System Sciences, 64(4): 785-819, 
2002. 

[5] M. Charikar and S. Guha. Improved combinatorial algorithms for the facility location and 
k-median problems. In FOCS '99: Proceedings of the 40th Annual Symposium on Foundations 
of Computer Science, 1999. 

[6] M. Charikar, S. Khuller, D. M. Mount, and G. Narasimhan. Algorithms for facility location 
problems with outliers. In SODA '01: Proceedings of the twelfth annual ACM- SI AM symposium 
on Discrete algorithms, pages 642-651, 2001. 

[7] K. Chen. A constant factor approximation algorithm for /c-median clustering with outliers. In 
Proc. ACM-SIAM Symp, on Discrete Algorithms (SODA), 2008. 

[8] F. Chu, J. Halpern, and J. Gehrke. Least expected cost query optimization: What can we 
expect? In Proc. of the ACM Symp. on Principles of Database Systems, 2002. 



16 



[9] F. Chu, J. Halpern, and P. Seshadri. Least expected cost query optimization: An exercise in 
utility. In Proc. of the ACM Symp. on Principles of Database Systems, 1999. 

[10] B. Dean, M. Goemans, and J. Vondrak. Approximating the stochastic knapsack problem: The 
benefit of adaptivity. Math. Oper. Res., 33(4):945-964, 2008. 

[11] B. C. Dean, M. X. Goemans, and J. Vondrak. Adaptivity and approximation for stochastic 
packing problems. Proceedings of SODA, pages 395-404, 2005. 

[12] A. Deshpande, C. Guestrin, S. Madden, J. M. Hellerstein, and W. Hong. Model-driven data 
acquisition in sensor networks. In Proc. of the 2004 Intl. Conf. on Very Large Data Bases, 
2004. 

[13] T. Feder, R. Motwani, R. Panigrahy, C. Olston, and J. Widom. Computing the median with 
uncertainty. SI AM J. Comput., 32(2):538-547, 2003. 

[14] N. Garg. Saving an epsilon: A 2-approximation for the &;-MST problem in graphs. In STOC 
'05: Proceedings of the thirty- seventh annual A CM symposium on Theory of computing, pages 
396-402, 2005. 

[15] A. Goel, S. Guha, and K. Munagala. How to probe for an extreme value. ACM Trans. 
Algorithms (to appear), 2008. Preliminary version appeared in Proc. of the ACM Symp. on 
Principles of Database Systems (PODS), 2006. 

[16] A. Goel and P. Indyk. Stochastic load balancing and related problems. In Proc. of the Annual 
Symp. on Foundations of Computer Science, 1999. 

[17] M. X. Goemans and D. P. Williamson. A general approximation technique for constrained 
forest problems. SI AM J. Comput, 24(2):296-317, 1995. 

[18] S. Guha and K. Munagala. Model-driven optimization using adaptive probes. In Proc. ACM- 
SIAM Symp. on Discrete Algorithms, pages 308-317, 2007. 

[19] S. Guha and K. Munagala. Sequential design of experiments via linear programming. Com- 
puting Research Repository, arXiv: 0805. 2630, 2008. Preliminary version appeared in the ACM 
Symposium on Theory of Computing, 2007. 

[20] S. Guha, K. Munagala, and S. Sarkar. Information acquisition and exploitation in multi- 
channel wireless systems. Computing Research Repository, arXiv:0804 .1724\ 2008. Preliminary 
version appeared in the ACM SIGMETRICS Conference, 2006. 

[21] P. K. Gummadi, H. V. Madhyastha, S. D. Gribble, H. M. Levy, and D. Wetherall. Improving 
the reliability of internet paths with one-hop source routing. In 6th Symposium on Operating 
System Design and Implementation ( OSDI), pages 183-198, 2004. 

[22] A. Gupta and M. Pal. Stochastic steiner trees without a root. Proc. of ICALP, pages 1051- 
1063, 2005. 

[23] A. Gupta, M. Pal, R. Ravi, and A. Sinha. Boosted sampling: Approximation algorithms for 
stochastic optimization. In Proc. of the Annual ACM Symp. on Theory of Computing, 2004. 

[24] A. Gupta, M. Pal, R. Ravi, and A. Sinha. What about Wednesday? Approximation algorithms 
for multistage stochastic optimization. Proc. of APPROX-RANDOM, pages 86-98, 2005. 



17 



[25] A. Gupta, R. Ravi, and A. Sinha. LP rounding approximation algorithms for stochastic 
network design, volume 32, pages 345-364, 2007. 

[26] N. Immorlica, D. Karger, M. Minkoff, and V. Mirrokni. On the costs and benefits of procras- 
tination: Approximation algorithms for stochastic combinatorial optimization problems. In 
Proc. of the Annual ACM-SIAM Symp. on Discrete Algorithms, 2004. 

[27] S. Khanna and W-C. Tan. On computing functions with uncertainty. In Proc. of the ACM 
Symp. on Principles of Database Systems, 2001. 

[28] J. Kleinberg, Y. Rabani, and E. Tardos. Allocating bandwidth for bursty connections. SIAM 
J. Comput, 30(1):191-217, 2000. 

[29] A. Krause and C. Guestrin. Near-optimal nonmyopic value of information in graphical models. 
Twenty-first Conference on Uncertainty in Artificial Intelligence (UAI 2005), pages 324-331, 
2005. 

[30] R. H. Mohring, A. S. Schulz, and M. Uetz. Approximation in stochastic scheduling: the power 
of LP-based priority policies. J. ACM, 46(6):924-942, 1999. 

[31] C. Olston. Approximate Replication. PhD thesis, Stanford University, 2003. 

[32] M. Pinedo. Sceduling: Theory Algorithms and Systems. Prentice-Hall, 1995. 

[33] G. Robins and A. Zelikovsky. Improved steiner tree approximation in graphs. In SODA 
'00: Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms, pages 
770-779, 2000. 

[34] D. Shmoys and C. Swamy. An approximation scheme for stochastic linear programming and 
its application to stochastic integer programs. J. ACM, 53(6):978-1012, 2006. 

[35] M. Skutella and M. Uetz. Stochastic machine scheduling with precedence constraints. SIAM 
J. Comput, 34(4):788-802, 2005. 

[36] C. Swamy and D. B. Shmoys. Sampling-based approximation algorithms for multi-stage 
stochastic. In FOCS '05: Proceedings of the 46th Annual IEEE Symposium on Foundations of 
Computer Science, pages 357-366, 2005. 



18 



